<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
  <!-- #BeginTemplate "/Templates/model3.dwt" -->
  <meta name="description"
  content="DigitalPebble Ltd is a consultancy specialised in web crawling, natural language processing, information retrieval and extraction. Our expertise is based on open source solutions, such as Nutch, Gate or SOLR.">
  <meta name="keywords" content="crawl, gate, consultant, consultancy, consulting, information extraction, information retrieval, NLP, IR, IE, nutch, solr">
  <link rel="icon" href="img/favicon.ico" type="image/vnd.microsoft.icon">
  <link type="text/css" href="style.css" rel="stylesheet">
  <meta name="google-site-verification" content="ZNIbylXN61hwJhB39tK17-u7RsU5kgiHXWbQ5F7lrNc" />
  <!-- #BeginEditable "doctitle" -->
  <title>DigitalPebble Ltd - Open Source Solutions for Text Engineering</title>
  <!-- #EndEditable -->
</head>

<a rel="nofollow" href="inexistent.html"/>

<body style="color: rgb(0, 0, 0); background-color: rgb(255, 255, 255);"
alink="#000000" link="#000000" vlink="#000000">

<table align="center" border="0" cellspacing="0" width="70%">
  <tbody>
    <tr>
      <td valign="bottom"><img src="img/logo.gif" alt="digitalpebble"
        align="bottom" height="60" width="310"></td>
    </tr>
    <tr>
      <td>&nbsp;</td>
    </tr>
    <tr>
      <td align="left" valign="middle"><!-- #BeginEditable "menu" -->

        <table border="0" cellpadding="0" cellspacing="0">
          <tbody>
            <tr>
              <td><a href="index.html"><img name="Home"
                src="./img/menu/home2.png" border="0"></a></td>
              <td>&nbsp;</td>
              <td><a href="solutions.html"><img name="Solutions"
                src="./img/menu/solutions1.png" border="0"></a></td>
              <td>&nbsp;</td>
             <td><a href="references.html"><img name="Clients"
                src="./img/menu/clients1.png" border="0"></a></td>
              <td>&nbsp;</td>
              <td><a href="contact.html"><img src="img/menu/contact1.png"
                style="border: 0px solid ; width: 108px; height: 32px;"
                alt="" name="Contact" onload=""></a></td>
              <td>&nbsp;</td>
            </tr>
          </tbody>
        </table>
        <!-- #EndEditable -->
      </td>
    </tr>
    <tr>
      <td>
        <div id="tabs">
        </div>
      </td>
    </tr>
  </tbody>
</table>

<table align="center" border="0" cellspacing="0" width="70%">
  <!-- #BeginEditable "crumbs" -->
  <tbody>
    <tr>
      <td colspan="3" align="left" valign="top">&nbsp;</td>
    </tr>
    <!-- #EndEditable -->
    <tr>
      <td valign="top" width="270"><!-- #BeginEditable "picture" -->
        <img src="img/small5.jpg"
        alt=""
        height="182" width="255"><!-- #EndEditable -->
      </td>
      <td valign="top" width="50">&nbsp;</td>
      <td align="left" valign="top" width="*"><!-- #BeginEditable "text" -->
        <p class="aligned"><span class="concept">DigitalPebble Ltd</span>
        is a consultancy and solution provider specialising in web crawling, natural language processing, 
        document retrieval and information extraction.</p>

        <p class="aligned">We advise, evaluate and implement solutions based
        on leading <a href="solutions.html">open source solutions</a>, such
        as <a href="http://nutch.apache.org/">Apache Nutch</a>,
       <a href="http://gate.ac.uk">GATE</a> or <a href="http://lucene.apache.org/solr">SOLR</a>. We aim to combine open
        source tools to provide efficient, reliable and low cost
        made-to-order solutions.</p>

        <p class="aligned">Our unique expertise covers all aspects of
        documents life cycle, from web-wide crawling and collection, content
        analysis, filtering and categorization to indexing. We are
        specialised in large scale processing using <a
        href="http://hadoop.apache.org/">Hadoop</a> or <a href="http://storm.apache.org/">Storm</a> and have expertise in cloud platforms such as Amazon AWS, which has allowed
        us to successfully deploy solutions scaling up to billions of documents for our <a href="references.html">clients</a>. </p>
        
        <p class="aligned">Not only to we have an extensive knowledge of open source solutions, we are also active contributors and 
        provide some of the <a href="https://github.com/DigitalPebble">resources</a> that we have developed over the years under open source licenses.
         </p>
        
        <p class="aligned">Our <a href="references.html">clients</a> range from startup in stealth mode to NASDAQ listed companies and operate in domains as varied as business intelligence, media monitoring, 
        telecommunications or software development.
        </p>
        
      </td>
     </td>
      
    </tr>
  </tbody>
</table>
<script type="text/javascript"
src="http://www.google-analytics.com/urchin.js">
</script>
<script type="text/javascript">
_uacct = "UA-357582-1";
urchinTracker();</script>
<!-- #EndTemplate -->
</body>
</html>
